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Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: |S [pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 



Computing curricula 2001 

September 2001 Journal on Educational Resources in Computing (JERIC) 

Full text available: pdf(61 3.63 KB) 
m html(2.78 KB) 



Additional Information: full citation , references , citings , index terms 



An empirically based system for processing definite descriptions 
Renata Vieira, Massimo Poesio 

December 2000 Computational Linguistics, volume 26 issue 4 
Full text available 



pdf(2.94 MB) Additional Information: full citation , abstract , references 

Publisher Site 

We present an implemented system for processing definite descriptions in arbitrary 
domains. The design of the system is based on the results of a corpus analysis previously 
reported, which highlighted the prevalence of discourse-new descriptions in newspaper 
corpora. The annotated corpus was used to extensively evaluate the proposed techniques 
for matching definite descriptions with their antecedents, discourse segmentation, 
recognizing discourse-new descriptions, and suggesting anchors for brid ... 

Clustering: Probabilistic combination of text classifiers using reliability indicators: 
models and results 

Paul N. Bennett, Susan T. Dumais, Eric Horvitz 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: Ijjjl pdfn 26.99 KB) Additional Information: full citation , abstract , references , index terms 
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The intuition that different text classifiers behave in qualitatively different ways has long 
motivated attempts to build a better metaclassifier via some combination of classifiers. We 
introduce a probabilistic method for combining classifiers that considers the context- 
sensitive reliabilities of contributing classifiers. The method harnesses reliability indicators— 
variables that provide a valuable signal about the performance of classifiers in different 
situations. We provide backgrou ... 

Keywords: classifier combination, metaclassifiers, reliability indicators, text classification 



5 A structural view of the Cedar programming environment 

Daniel C. Swinehart, Polle T. Zellweger, Richard J. Beach, Robert B. Hagmann 

August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 8 Issue 4 

i- it* ^ i ui 0 -ir/e Additional Information: full citation , abstract , references , citings , index 

Full text available: 153 pdf(6,32 MB) 

LC - r terms 

This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are organized. 
Cedar supports the development of programs written in a single programming language, 
also called Cedar. Its primary purpose is to increase the productivity of programmers whose 
activities include experimental programming and the development of prototype software 
systems for a high-performance personal computer. T ... 

6 Query evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 

i- ii* ^ i ui 0i ,i*vn 0-7 **d\ Additional Information: full citation , abstract , references , citings , index 

Full text available: 153 pdf(9.37 MB) 

terms , review 

Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to 
provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 



7 IS '97: model curriculum and guidelines for undergraduate degree programs in 
information systems 

Gordon B. Davis, John T. Gorgone, J. Daniel Couger, David L. Feinstein, Herbert E. 
Longenecker 

December 1996 ACM SIGMIS Database , Guidelines for undergraduate degree programs 
on Model curriculum and guidelines for undergraduate degree 
programs in information systems, volume 28 issue l 

Full text available: ^ pdf(7.24 MB) Additional Information; full citation , citings 



8 Interactive Editing Systems: Part II Q 
Norman Meyrowitz, Andries van Dam 

September 1982 ACM Computing Surveys (CSUR), Volume 14 issue 3 

Full text available: ^ pdf(9.17 MB) Additional Information: full citation , references, citings , index terms 
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9 Incremental data flow analysis in a structured program editor Q 
Frank Kenneth Zadeck 

June 1984 ACM SIGPLAN Notices , Proceedings of the 1984 SIGPLAIM symposium on 

Compiler construction, Volume 19 issue 6 
Full text available: ^ pdf(1.03 MB) Additional Information: full citation , abstract , references , citings 

For many years, the generally accepted methods for solving global data flow analysis 
problems (GDFAPs) have relied on the bit vector paradigm [Alle 70] [Kenn 81]. In these 
algorithms, the information for each variable is represented as a position in a bit vector. 
These vectors are propagated from node to node during the analysis of the program.This 
paper investigates a different paradigm for solving GDFAPs. In this strategy, the data flow 
solution for each variable is computed independent ... 

10 Program Transformation Systems Q 
H. Partsch, R. Steinbruggen 

September 1983 ACM Computing Surveys (CSUR), Volume 15 issue 3 

Full text available: ^ pdf(3.00 MB) Additional Information: full citation , references , citings , index terms 



11 Curriculum 68: Recommendations for academic programs in computer science: a 
report of the ACM curriculum committee on computer science 

William F. Atchison, Samuel D. Conte, John W. Hamblen, Thomas E. Hull, Thomas A. Keenan, 
William B. Kehl, Edward J. McCluskey, Silvio O. Navarro, Werner C. Rheinboldt, Earl J. 
Schweppe, William Viavant, David M. Young 
March 1968 Communications of the ACM, Volume n issue 3 

Full text available: ^ pdf(6.63 MB) Additional Information: full citation , references , citings 



Keywords: computer science academic programs, computer science bibliographies, 
computer science courses, computer science curriculum, computer science education, 
computer science graduate programs, computer science undergraduate programs 



12 Data base directions: the next steps Q 
John L. Berg 

November 1976 , Volume 8 , 8 Issue 4 , 2 

Full text available: ^ pdf(9.95 MB) Additional Information: full citation , abstract 

What information about data base technology does a manager need to make prudent 
decisions about using this new technology? To provide this information the National Bureau 
of Standards and the Association for Computing Machinery established a workshop of 
approximately 80 experts in five major subject areas. The five subject areas were auditing, 
evolving technology, government regulations, standards, and user experience. Each area 
prepared a report contained in these proceedings. The proceedings p ... 

Keywords: DBMS, auditing, cost/benefit analysis, data base, data base management, 
government regulation, management objectives, privacy, security, standards, technology 
assessment, user experience 



13 Using thumbnails to search the Web 

Allison Woodruff, Andrew Faulring, Ruth Rosenholtz, Julie Morrsion, Peter Pirolli 
March 2001 Proceedings of the SIGCHI conference on Human factors in computing 
systems 

r- .. 4 ^ u a Additional Information: full citation , abstract , references , citings , index 
Full text available: ^ pdf(424.11 KB) terms 

We introduce a technique for creating novel, textually-enhanced thumbnails of Web pages, 
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These thumbnails combine the advantages of image thumbnails and text summaries to 
provide consistent performance on a variety of tasks. We conducted a study in which 
participants used three different types of summaries (enhanced thumbnails, plain 
thumbnails, and text summaries) to search Web pages to find several different types of 
information. Participants took an average of 67, 86, and 95 seconds to f ... 

Keywords: Web search task, thumbnails 



14 Automatic parsing for content analysis 
Frederick J. Damerau 

June 1970 Communications of the ACM, Volume 13 issue 6 

Full text available: ^pdf(4.07 MB) Additional Information: full citation, abstract , references, citings 

Although automatic syntactic and semantic analysis is not yet possible for all of an 
unrestricted natural language text, some applications, of which content analysis is one, do 
not have such a stringent coverage requirement. Preliminary studies show that the Harvard 
Syntactic Analyzer can produce correct and unambiguous identification of the subject and 
object of certain verbs for approximately half of the relevant occurences. This provides a 
degree of coverage for content analysis variable ... 

Keywords: content analysis, information retrieval, language analysis, natural language 
processing, parsing, syntactic analysis, text processing 



15 Types and persistence in database programming languages 
Malcolm P. Atkinson, O. Peter Buneman 

June 1987 ACM Computing Surveys (CSUR), Volume 19 issue 2 

c ... , i ui « ^/-7 oh Additional Information: full citation , abstract , references , citings , index 

Full text available: I 5q pdf(7.91 MB) a -' 

terms , review 

Traditionally, the interface between a programming language and a database has either 
been through a set of relatively low-level subroutine calls, or it has required some form of 
embedding of one language in another. Recently, the necessity of integrating database and 
programming language techniques has received some long-overdue recognition. In 
response, a number of attempts have been made to construct programming languages with 
completely integrated database management systems. These lang ... 

16 News on-demand for multimedia networks 
Gene Miller, Greg Baber, Mark Gilliland 

September 1993 Proceedings of the first ACM international conference on Multimedia 

Full text available: fiBl pdf(1 06.03 KB) AJJ .. ttr . . . 

p f ps(1.21 1MB) Additional Information: full citation , references , citings , index terms 



Keywords: authoring, database, digital video, information service, messaging, networked 
multimedia, news, retrieval, wideband 



17 Parallel text search methods 
Gerard Salton, Chris Buckley 

February 1988 Communications of the ACM, volume 31 issue 2 

c ... , .. U1 a co fc . DX Additional Information: full citation , abstract , references, citings , index 

Full text available: T Bqpdfn.53 MB) f 

terms , review 

A comparison of recently proposed parallel text search methods to alternative available 
search strategies that use serial processing machines suggests parallel methods do not 
provide large-scale gains in either retrieval effectiveness or efficiency. 
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18 Special issue on persistent object systems: Orthogonally persistent object systems Q 
Malcolm Atkinson, Ronald Morrison 

July 1995 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 4 Issue 3 

Full text available: ^pdf(5.02 MB) Additional Information: full citation, abstract , references, citings 

Persistent Application Systems (PASs) are of increasing social and economic importance. 
They have the potential to be long-lived, concurrently accessed, and consist of large bodies 
of data and programs. Typical examples of PASs are CAD/CAM systems, office automation, 
CASE tools, software engineering environments, and patient-care support systems in 
hospitals. Orthogonally persistent object systems are intended to provide improved support 
for the design, construction, maintenance, and operation o ... 

Keywords: database programming languages, orthogonal persistence, persistent 
application systems, persistent programming languages 

19 The proposed new Computing Reviews classification scheme Q 
Anthony Ralston 

July 1981 Communications of the ACM, Volume 24 issue 7 

Full text available: 1 pdf(972.02 Additiona| | nformation: fu || citation , citings , index terms 
KB) 



20 Discourse segmentation by human and automated means 
Rebecca J. Passonneau, Diane J. Litman 
March 1997 Computational Linguistics, Volume 23 issue l 

Full text available: ^ [fjj] 

*Pj)paf(2.7l MB) tO 1 Additional Information: full citation , abstract , references , citings 
Publisher Site 

The need to model the relation between discourse structure and linguistic features of 
utterances is almost universally acknowledged in the literature on discourse. However, there 
is only weak consensus on what the units of discourse structure are, or the criteria for 
recognizing and generating them. We present quantitative results of a two-part study using 
a corpus of spontaneous, narrative monologues. The first part of our paper presents a 
method for empirically validating multitutterance units ... 
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21 Concepts and capabilities of a database computer\ 
Jayanta Banerjee, David K. Hsiao, Richard I. Baum 

December 1978 ACM Transactions on Database Systems (TODS), Volume 3 issue 4 

Additional Information: full citation , abstract , references, citings , index 
terms 



Full text available: Q pdf(2.79 MB) 



The concepts and capabilities of a database computer (DBC) are given in this paper. The 
proposed design overcomes many of the traditional problems of database system software 
and is one of the first to describe a complete data-secure computer capable of handling 
large databases. This paper begins by characterizing the major problems facing today's 
database system designers. These problems are intrinsically related to the nature of 
conventional hardware and can only be solved by i ... 

Keywords: clustering, content-addressable memory, database computers, keywords, mass 
memory, performance, security, structure memory 



22 The PSG system: from formal language definitions to interactive programming 
environments 
Rolf Bahlke, Gregor Snelting 

August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 



Volume 8 Issue 4 
Full text available: ^pdf(2.35MB) 



Additional Information: full citation , abstract , references , citings , index 
terms , review 



The PSG programming system generator developed at the Technical University of Darmstadt 
produces interactive, language-specific programming environments from formal language 
definitions. All language-dependent parts of the environment are generated from an entirely 
nonprocedural specification of the language's syntax, context conditions, and dynamic 
semantics. The generated environment consists of a language-based editor, supporting 
systematic program development by named program fragments, ... 

23 Technical reports 
SIGACT News Staff 

January 1980 ACM SIGACT News, Volume 12 issue l 

Full text available: Qpdf(5.28 MB) Additional Information: full citation 



24 Applications of combinatorial designs in computer science 
Charles J. Colbourn, Paul C. van Oorschot 
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June 1989 ACM Computing Surveys (CSUR), Volume 21 issue 2 

Full text available: pdf(2.99 MB) Additional Information: full citation , abstrad, references , citings, index 
tSh ^ terms , review 

The theory of combinatorial designs has been used in widely different areas of computation 
concerned with the design and analysis of both algorithms and hardware. Combinatorial 
designs capture a subtle balancing property that is inherent in many difficult problems and 
hence can provide a sophisticated tool for addressing these problems. The role of 
combinatorial designs in solving many problems that are basic to the field of computing is 
explored in this paper. Case studies of many applicat ... 

25 Concurrent search of mobile users in cellular networks 
Rung-Hung Gau, Zygmunt J. Haas 

February 2004 IEEE/ACM Transactions on Networking (TON), Volume 12 issue l 

Full text available: ^pdf(381.70 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we propose to concurrently search for a number of mobile users in a wireless 
cellular network based on the probabilistic information about the locations of mobile users. 
The concurrent search approach guarantees that all k mobile users will be located within k 
time slots. It is shown that even in the worst case when mobile users appear equally in all 
the cells of the network, the concurrent search approach is able to reduce the average 
paging cost by 25%. More import ... 

Keywords: cellular networks, concurrent search, probabilistic location information 



26 Searching in metric spaces with user-defined and approximate distances | 
Paolo Ciaccia, Marco Patella 

December 2002 ACM Transactions on Database Systems (TODS), volume 27 issue 4 

Full text available: j ppdf(555.89 KB) Additional Information: full citation , abstract, references , dtjnas, index 

terms 

Novel database applications, such as multimedia, data mining, e-commerce, and many 
others, make intensive use of similarity queries in order to retrieve the objects that better fit 
a user request. Since the effectiveness of such queries improves when the user is allowed to 
personalize the similarity criterion according to which database objects are evaluated and 
ranked, the development of access methods able to efficiently support user-defined 
similarity queries becomes a basic requirement. In t ... 

Keywords: Distance metrics, user-defined queries 



27 Information retrieval on the web 
Mei Kobayashi, Koichi Takeda 

June 2000 ACM Computing Surveys (CSUR), Volume 32 issue 2 

Full text available- « Pdf(213.89 KB) Additional Information: full citation , abstract, references , cjtiogs, index 

terms 

In this paper we review studies of the growth of the Internet and technologies that are 
useful for information search and retrieval on the Web. We present data on the Internet 
from several different sources, e.g., current as well as projected number of users, hosts, 
and Web sites. Although numerical figures vary, overall trends cited by the sources are 
consistent and point to exponential growth in the past and in the coming decade. Hence it is 
not surprising that about 85% of Internet user ... 

Keywords: Internet, World Wide Web, clustering, indexing, information retrieval, 
knowledge management, search engine 
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29 Query clustering using user logs 

January 2002 ACM Transactions on Information Systems (TOIS), volume 20 issue l 

r- H4 i ui a ma oa Additional Information: full citation, abstract , references , citings , index 

Full text available: Ta pdu 1.31 MB) 3 

terms , review 

Query clustering is a process used to discover frequently asked questions or most popular 
topics on a search engine. This process is crucial for search engines based on question- 
answering. Because of the short lengths of queries, approaches based on keywords are not 
suitable for query clustering. This paper describes a new query clustering method that 
makes use of user logs which allow us to identify the documents the users have selected for 
a query. The similarity between two queries may be ded ... 

Keywords: Query clustering, search engine, user log, web data mining 



30 DEADLINER: building a new niche search engine Q 
A. Kruger, C. L. Giles, F. M. Coetzee, E. Glover, G. W. Flake, S. Lawrence, C. Omlin 
November 2000 Proceedings of the ninth international conference on Information and 
knowledge management 

Full text available: ^ pdf(340.59 KB) Additional Information: full citation , references , citings , index terms 



Keywords: Bayesian fusion approaches, text extraction, web search 



31 On-line Text Editing: A Survey 
Andries van Dam, David E. Rice 

September 1971 ACM Computing Surveys (CSUR), Volume 3 issue 3 

Full text available* 13sl odfd 91 MB) Additional Information: full citation, abstract, references, citings, index 
* ^ 1 terms 

This paper is a survey of current methods for the on-line creation and editing of computer 
programs and of ordinary manuscripts text. The characteristics of on-line editing systems 
are examined and examples of various implementations are described in three categories: 
program editors, text editors, and terminals with local editing facilities. 

32 Model checking without a model: an analysis of the heart-beat monitor of a telephone 
switch using VeriSoft 

Patrice Godefroid, Robert S. Hanmer, Lalita Jategaonkar Jagadeesan 

March 1998 ACM SIGSOFT Software Engineering Notes , Proceedings of the 1998 ACM 
SIGSOFT international symposium on Software testing and analysis, volume 

23 Issue 2 

Full text available: 1 1pdff1.15 MB) Additional Information: full citation , abstract, references , citings, index 
13 terms 

VeriSoft is a tool for systematically exploring the state spaces of systems composed of 
several concurrent processes executing arbitrary code written in full-fledged programming 
languages such as C or C+ + . The state space of a concurrent system is a directed graph 
that represents the combined behavior of all concurrent components in the system. By 
exploring its state space, VeriSoft can automatically detect coordination problems between 
the processes of a concurrent system. We report in this pape ... 

33 Workshop on compositional software architectures: workshop report 
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34 Knowledge and representation: Leveraging a common representation for personalized Q 
search and summarization in a medical digital library 

Kathleen R. McKeown, Noemie Elhadad, Vasileios Hatzivassiloglou 

May 2003 Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries 

r- ii* ^ i ui 0i *o l<d\ Additional Information: full citation , abstract , references , citings, index 

Full text available: tja pdfn 16.18 KB) 5 *-' 

l£3 ~^ terms 

Despite the large amount of online medical literature, it can be difficult for clinicians to find 
relevant information at the point of patient care. In this paper, we present techniques to 
personalize the results of search, making use of the online patient record as a sophisticated, 
pre-existing user model. Our work in PERSIVAL, a medical digital library, includes methods 
for re-ranking the results of search to prioritize those that better match the patient record. 
It also generates summa ... 

35 Special issue on word sense disambiguation: Introduction to the special issue on word Q 
sense disambiguation: the state of the art 

Nancy Ide, Jean Veronis 

March 1998 Computational Linguistics, Volume 24 issue l 

Full text available: = A . (9 

I^PdUo.^lviBj^O 1 Additional Information: full citation, references , citings 

Publisher Site 



36 Bandwidth-delay constrained path selection under inaccurate state information 
Turgay Korkmaz, Marwan Krunz 

June 2003 IEEE/ACM Transactions on Networking (TON), volume n issue 3 

Full text available: ^ pdf(976.94 KB) Additional Information: full citation , abstract , references , index terms 

One of the key issues in any quality-of-service (QoS) routing framework is how to compute 
a path that satisfies given QoS constraints. In this paper, we focus on the path computation 
problem subject to the bandwidth and delay constraints. This problem can be easily solved if 
the exact state information is available to the node performing the path computation 
function. In practice, however, nodes have only imprecise knowledge of the network state. 
The reliance on outdated information and treating ... 

Keywords: Lagrange relaxation, multiobjective optimization, quality-of-service (QoS) 
routing, stochastic shortest path 
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Norman Meyrowitz, Andries van Dam 

September 1982 ACM Computing Surveys (CSUR), volume 14 issue 3 
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38 Distributed file systems: concepts and examples 
Eliezer Levy, Abraham Silberschatz 

December 1990 ACM Computing Surveys (CSUR), Volume 22 issue 4 

Full text available: ^ pdf(5.33 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms , review 

The purpose of a distributed file system (DFS) is to allow users of physically distributed 
computers to share data and storage resources by using a common file system. A typical 
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configuration for a DFS is a collection of workstations and mainframes connected by a local 
area network (LAN). A DFS is implemented as part of the operating system of each of the 
connected computers. This paper establishes a viewpoint that emphasizes the dispersed 
structure and decentralization of both data and con ... 

39 Parallelism in relational data base systems: architectural issues and design 
approaches 

Hamid Pirahesh, C. Mohan, Josephine Cheng, T. S. Liu, Pat Selinger 
July 1990 Proceedings of the second international symposium on Databases in 
parallel and distributed systems 

Full text available: 1 ^Ddf(2.50 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

With current systems, some important complex queries may take days to complete because 
of: (1) the volume of data to be processed, (2) limited aggregate resources. Introducing 
parallelism addresses the first problem. Cheaper, but powerful computing resources solve 
the second problem. According to a survey by Brodie,l only 10% of computerized data is in 
data bases. This is an argument for both more variety and volume of data to be moved into 
data base systems. We conject ... 

40 Structuring Z specifications with views 
Daniel Jackson 

October 1995 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 4 Issue 4 

Full text available- 1Sl Ddf(1 66 MB) Additional Information: full citation , abstract , references , citings , index 

terms , review 

A view is a partial specification of a program, consisting of a state space and a set of 
operations. A full specification is obtained by composing several views, linking them through 
their states (by asserting invariants across views) and through their operations (by defining 
external operations as combinations of operations from different views). By encouraging 
multiple representations of the program's state, view structuring lends clarity and terseness 
to the specification of operations. ... 

Keywords: Z, formal specification, implicit definition, views 
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1 Fast detection of communication patterns in distributed executions 
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November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 
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An empirically based system for processing definite descriptions 
Renata Vieira, Massimo Poesio 

December 2000 Computational Linguistics, Volume 26 issue 4 

Full text available: „ rfjj] 

HH paf(2.94 MB) Additional Information: full citation , abstract , references 
Publisher Site 

We present an implemented system for processing definite descriptions in arbitrary 
domains. The design of the system is based on the results of a corpus analysis previously 
reported, which highlighted the prevalence of discourse-new descriptions in newspaper 
corpora. The annotated corpus was used to extensively evaluate the proposed techniques 
for matching definite descriptions with their antecedents, discourse segmentation, 
recognizing discourse-new descriptions, and suggesting anchors for brid ... 

Clustering: Probabilistic combination of text classifiers using reliability indicators: 
models and results 

Paul N. Bennett, Susan T. Dumais, Eric Horvitz 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: ^ pdfd 26.99 KB) Additional Information: full citation , abstract , references , index terms 
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The intuition that different text classifiers behave in qualitatively different ways has long 
motivated attempts to build a better metaclassifier via some combination of classifiers. We 
introduce a probabilistic method for combining classifiers that considers the context- 
sensitive reliabilities of contributing classifiers. The method harnesses reliability indicators— 
variables that provide a valuable signal about the performance of classifiers in different 
situations. We provide backgrou ... 

Keywords: classifier combination, metactassifiers, reliability indicators, text classification 
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Daniel C. Swinehart, Polle T. Zellweger, Richard J. Beach, Robert B. Hagmann 

August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 8 Issue 4 

Full text available: HB Ddf(6.32 MB) Additional Information: full citation , abstract, references , citings, index 
1 terms 

This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are organized. 
Cedar supports the development of programs written in a single programming language, 
also called Cedar. Its primary purpose is to increase the productivity of programmers whose 
activities include experimental programming and the development of prototype software 
systems for a high-performance personal computer. T ... 

IS '97: model curriculum and guidelines for undergraduate degree programs in 
information systems 

Gordon B. Davis, John T. Gorgone, J. Daniel Couger, David L. Feinstein, Herbert E. 
Longenecker 

December 1996 ACM SIGMIS Database , Guidelines for undergraduate degree programs 
on Model curriculum and guidelines for undergraduate degree 
programs in information systems, Volume 28 issue l 

Full text available: ^ pdf(7.24 MB) Additional Information: full citation, citings 
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Query evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), Volume 25 issue 2 

r- II* ^ i ui dPt ,, /ft „ ym Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(9,37 MB) r 
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Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to 
provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 



8 Interactive Editing Systems: Part II Q 
Norman Meyrowitz, Andries van Dam 
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9 Incremental data flow analysis in a structured program editor Q 
Frank Kenneth Zadeck 

June 1984 ACM SIGPLAN Notices , Proceedings of the 1984 SIGPLAN symposium on 

Compiler construction, Volume 19 issue 6 
Full text available: ^ pdf(1.03 MB) Additional Information: full citation , abstract, references , citings 

For many years, the generally accepted methods for solving global data flow analysis 
problems (GDFAPs) have relied on the bit vector paradigm [Alle 70] [Kenn 81]. In these 
algorithms, the information for each variable is represented as a position in a bit vector. 
These vectors are propagated from node to node during the analysis of the program.This 
paper investigates a different paradigm for solving GDFAPs. In this strategy, the data flow 
solution for each variable is computed independent ... 

10 Program Transformation Systems Q 
H. Partsch, R. Steinbruggen 

September 1983 ACM Computing Surveys (CSUR), Volume 15 issue 3 

Full text available: ^ pdfP.OO MB) Additional Information: full citation , references , citings, index terms 



11 Curriculum 68: Recommendations for academic programs in computer science: a 
report of the ACM curriculum committee on computer science 

William F. Atchison, Samuel D. Conte, John W. Hamblen, Thomas E. Hull, Thomas A. Keenan, 
William B. Kehl, Edward J. McCluskey, Silvio O. Navarro, Werner C. Rheinboldt, Earl J. 
Schweppe, William Viavant, David M. Young 
March 1968 Communications of the ACM, Volume 11 issue 3 

Full text available: ^ pdf(6.63 MB) Additional Information: full citation , references , citings 
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12 Data base directions: the next steps Q 
John L. Berg 

November 1976 , volume 8 , 8 issue 4 , 2 

Full text available: ^ pdf(9.95 MB) Additional Information: full citation , abstract 

What information about data base technology does a manager need to make prudent 
decisions about using this new technology? To provide this information the National Bureau 
of Standards and the Association for Computing Machinery established a workshop of 
approximately 80 experts in five major subject areas. The five subject areas were auditing, 
evolving technology, government regulations, standards, and user experience. Each area 
prepared a report contained in these proceedings. The proceedings p ... 

Keywords: DBMS, auditing, cost/benefit analysis, data base, data base management, 
government regulation, management objectives, privacy, security, standards, technology 
assessment, user experience 



13 Using thumbnails to search the Web 

Allison Woodruff, Andrew Faulring, Ruth Rosenholtz, Julie Morrsion, Peter Pirolli 
March 2001 Proceedings of the SIGCHI conference on Human factors in computing 
systems 

Full text available* 153 Ddf(424 1 1 KB) Add ' tiona l Information: full citation , abstract , references , citings , index 
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We introduce a technique for creating novel, textually-enhanced thumbnails of Web pages, 
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These thumbnails combine the advantages of image thumbnails and text summaries to 
provide consistent performance on a variety of tasks. We conducted a study in which 
participants used three different types of summaries (enhanced thumbnails, plain 
thumbnails, and text summaries) to search Web pages to find several different types of 
information. Participants took an average of 67, 86, and 95 seconds to f ... 

Keywords: Web search task, thumbnails 



14 Automatic parsing for content analysis Q 
Frederick J. Damerau 

June 1970 Communications of the ACM, volume 13 issue 6 

Full text available: ^ pdf(4.07 MB) Additional Information: full citation , abstract , references , citings 

Although automatic syntactic and semantic analysis is not yet possible for all of an 
unrestricted natural language text, some applications, of which content analysis is one, do 
not have such a stringent coverage requirement. Preliminary studies show that the Harvard 
Syntactic Analyzer can produce correct and unambiguous identification of the subject and 
object of certain verbs for approximately half of the relevant occurences. This provides a 
degree of coverage for content analysis variable ... 

Keywords: content analysis, information retrieval, language analysis, natural language 
processing, parsing, syntactic analysis, text processing 



15 Types and persistence in database programming languages 
Malcolm P. Atkinson, O. Peter Buneman 

June 1987 ACM Computing Surveys (CSUR), volume 19 issue 2 

i- in ^ *i ui 0 -i*/-7 ^4 Additional Information: full citation , abstract , references , citings , index 

Full text available: Tja] pdf(7.91 MB) 

terms , review 

Traditionally, the interface between a programming language and a database has either 
been through a set of relatively low-level subroutine calls, or it has required some form of 
embedding of one language in another. Recently, the necessity of integrating database and 
programming language techniques has received some long-overdue recognition. In 
response, a number of attempts have been made to construct programming languages with 
completely integrated database management systems. These lang ... 

16 News on-demand for multimedia networks 
Gene Miller, Greg Baber, Mark Gilliland 

September 1993 Proceedings of the first ACM international conference on Multimedia 

Full text available: fjl pdf(106.03 KB) *.*„■♦.. , v ♦ 
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Keywords: authoring, database, digital video, information service, messaging, networked 
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17 Special issue on persistent object systems: Orthogonally persistent object systems 
Malcolm Atkinson, Ronald Morrison 

July 1995 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 4 Issue 3 

Full text available: pdf(5.02 MB) Additional Information: full citation , abstract , references , citings 

Persistent Application Systems (PASs) are of increasing social and economic importance. 
They have the potential to be long-lived, concurrently accessed, and consist of large bodies 
of data and programs. Typical examples of PASs are CAD/CAM systems, office automation, 
CASE tools, software engineering environments, and patient-care support systems in 
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hospitals. Orthogonally persistent object systems are intended to provide improved support 
for the design, construction, maintenance, and operation o ... 

Keywords: database programming languages, orthogonal persistence, persistent 
application systems, persistent programming languages 
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Anthony Ralston 

July 1981 Communications of the ACM, volume 24 issue 7 
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19 Parallel text search methods 
Gerard Salton, Chris Buckley 

February 1988 Communications of the ACM, Volume 31 issue 2 
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Full text available: TO pdf(1.53 MB) 
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A comparison of recently proposed parallel text search methods to alternative available 
search strategies that use serial processing machines suggests parallel methods do not 
provide large-scale gains in either retrieval effectiveness or efficiency. 

20 Discourse segmentation by human and automated means 
Rebecca J. Passonneau, Diane J. Litman 
March 1997 Computational Linguistics, Volume 23 issue l 

Full text available: 



^ pdf(2.71 MB) %1P Additional Information: full citation , abstract , references , citings 
Publisher Site 

The need to model the relation between discourse structure and linguistic features of 
utterances is almost universally acknowledged in the literature on discourse. However, there 
is only weak consensus on what the units of discourse structure are, or the criteria for 
recognizing and generating them. We present quantitative results of a two-part study using 
a corpus of spontaneous, narrative monologues. The first part of our paper presents a 
method for empirically validating multitutterance units ... 
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